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Abstract 

Nonresponse is present in almost all surveys and can severely bias estimates. It is usually 
distinguished between unit and item nonresponse: in the former, we completely fail to have 
information from a unit selected in the sample, while in the latter, we observe only part of the 
information on the selected unit. Unit nonresponse is usually dealt with by reweighting: each 
unit selected in the sample has associated a sampling weight and an unknown response proba- 
bility; the initial sampling weight is multiplied by the inverse of estimated response probability. 
Item nonresponse is usually dealt with by imputation. By noting that for a particular survey 
variable, we just have observed and unobserved values, in this work we exploit the connection 
between unit and item nonresponse. In particular, we assume that the factors that drive unit 
response are the same as those that drive item response on selected variables of interest. Re- 
sponse probabilities are then estimated by using a logistic regression with a latent covariate 
that measures such will to respond and that can explain part of the unknown behavior of a unit 
to participate in the survey. The latent covariate is estimated using latent trait models. Such 
approach is particularly relevant for sensitive items and, therefore, can handle non-ignorable 
nonresponse. Auxiliary information known for both respondents and nonrespondents can be 
included either in the latent variable model or in the logistic model. The approach can be also 
used when auxiliary information is not available, and we focus here on this case. The theoret- 
ical properties of the proposed estimators are sketched and simulations studies are conducted 
to illustrate their finite size sample performance. 

Key words: unit nonresponse, item nonresponse, latent trait models, response propensity, 
non-ignorable nonresponse. 

1 Introduction 

Nonresponse is an increasingly common problem in surveys. It is a problem because, firstly, it 
causes missing data. It is usually distinguished between two types of missing data: unit nonresponse, 
when for a selected sample element the information is missing on all variables surveyed with the 
questionnaire, and item nonresponse, when the information is missing on at least one, but not 
all the survey variables. Nonresponse is a problem, secondly and more importantly, because such 
missingness is a potential and likely source of bias for survey estimates. 
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Randomness in survey sampling is guaranteed by a probabilistic sampling design. In the pres- 
ence of unit nonresponse, another source of randomness is added, which is generated by an un- 
known response mechanism. The sampling design associates to each unit in the population an 
inclusion probability, whose inverse, for sampled units, determines its sampling weight. Each unit 
in the population has also associated a response probability to respond to the survey. The response 
probability is unknown and several methods have been proposed to estimate it either explicitly, 
using response propensity modeling like logistic regression models (see e.g. Kim and Kim, 2007), 
or implicitly, using response homogeneity groups or more generally calibration (see Sarndal and 
Lundstrom, 2005, for an overview). Once estimates are computed, a commonly used method to 
deal with unit nonresponse is reweighting: sampling weights of the respondents are adjusted by 
the inverse of the estimated response probability providing new weights. The estimation of the 
response probabilities typically requires the availability of auxiliary information, either in the form 
of the value of some auxiliary variables for all units in the originally selected sample or of their 
population mean or total. 

Three types of nonresponse are commonly considered in survey practice (see Little and Ru- 
bin, 1987): missing completely at random (MCAR), when the response indicator to the survey is 
independent of all other variables in the survey; missing at random (MAR), when the response 
indicator depends only on some characteristics observed in the survey and available also for nonre- 
spondents; missing not at random (MNAR), when the response indicator depends on characteristics 
of interest observed only on the respondents or completely unobserved. Missing data mechanisms 
assuming an MCAR or a MAR response mechanism are called ignorable; all other mechanisms are 
non-ignorable. 

We focus here on non-ignorable nonresponse, that is typical of surveys with sensitive questions 
(concerning drug abuse, sexual attitudes, politics, income etc). Various approaches have been 
proposed in survey sampling literature to deal with non-ignorable nonresponse. These approaches 
can be roughly divided into likelihood based methods and reweighting methods. Note that all 
these methods make use of observed auxiliary information. Survey problems with non-ignorable 
nonrespondents have been discussed e.g. by Greenlees et al. (1982), Little and Rubin (1987), 
Beaumont (2000), Qin et al. (2002), Zhang (2002). Copas and Farewell (1998) introduce into 
the British National Survey of Sexual Attitudes and Lifestyles, in addition to the variables of 
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interest, a variable called 'enthusiasm-to-respond' to the survey, which is expected to be related 
to probabilities of unit and item response. They propose a method to estimate these probabilities 
using this variable to achieve unbiased estimates of population parameters. An approach based 
on the use of latent variables for modeling nonignorable nonresponse is given in Biemer and Link 
(2007), that extend the ideas in Drew and Fuller (1980) and use a discrete latent variable based 
on call history data available for all sample units. The latent variable is computed using some 
indicators of level of effort based on call attempts. 

We focus on unit nonresponse and propose a method of reweighting to reduce nonresponse 
bias in the case of non-ignorable nonresponse. The method does not require the availability of 
auxiliary information, on the sample or population level, but different assumptions are made. 
First, it is assumed that item nonresponse is present in the survey and that it affects m variables 
of particular interest. Thus a response indicator can be defined for each variable (item) £, for 
I = 1, . . . , m, taking value 1 if item I is observed on unit k and otherwise. Next, the response 
indicators are manifestations of an underlying continuous scale which determines a latent variable 
that is related to the response propensity of the units and to the variable of interest. It is possible 
to compute such latent variable for all units in the sample, not only for the respondents, and thus 
to use it as auxiliary information in a response probability estimation procedure. The outcome 
of this estimation procedure can finally be used in a reweighting fashion. No information about 
the nonrespondents and no auxiliary information (like sex, age, marital status) are required in the 
proposed method. 

The use of continuous latent variables to model item nonresponse is also considered in Moustaki 
and Knott (2000). In this paper, we take a different perspective and use latent variable models 
to address non-ignorable unit nonresponse also when auxiliary information is not available. We 
propose to use a latent variable called here 'will to respond to the survey', which is expected to 
be related to the probability of unit response, similar to the case of the 'enthusiasm-to-respond' 
variable as defined by Copas and Farewell (1998). This latent variable is suggestible to be computed 
in attitude surveys or surveys where the income is one of the variable of interest. As we have 
mentioned, our method does not need to use any auxiliary information about respondents and non- 
respondents, but we link unit nonresponse to item nonresponse via a latent variable and use this 
variable exactly like an auxiliary information in the process of estimating the response probabilities. 
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Following Moustaki and Knott (2000), 'weighting through latent variable modeling is expected to 
perform well under non-ignorable nonresponse where conditioning on observed covariates only is 
not enough.' Moreover, in the absence of any covariate, we expect that an estimator based on the 
proposed weighting system with latent variable will perform better in terms of bias reduction than 
the naive estimator computed on the set of respondents. 

The paper is organized as follows. Section 2 introduces the survey framework and notation, 
and reviews some classical estimators in the presence of nonresponse. In Section 3, we present 
how we can use latent variables as auxiliary information in nonresponse adjusted estimators. In 
Section 4, the practical properties of the proposed nonresponse adjusted estimators are evaluated 
via simulations. In Section 5, we summarize our conclusions. 

2 Framework 

Let U be a finite population of size N, indexed by k from 1 to N. Let s denote the set of sample 
labels, so that s C U. The sample size is denoted by n. The sample s is drawn using a probabilistic 
sampling design p(s) such that 

p( s ) > 0, for all s G S and ^^p(s) = 1, 

where S is the set of all samples s of size n. Let 7Tfe = ~^2 s . sB kP( s ) be the probability of including 
unit k in the sample. It is assumed that Wk > 0, k = 1, . . . , N. Not all units selected in s respond 
to the survey. Denote by r C s the set of respondents, and by f = s \ r the set of nonrespondents. 
The response mechanism is given by the distribution q(r\s) such that for every fixed s we have 

q(r\s) > 0, for all r G 1Z S and q{r\s) = 1, 

sen s 

where 1Z S = {r\r C s}. 

Under unit nonresponse we define the response indicator Rk = 1 if unit k G r and if k G f. 
Thus r = {k G s\Rk = 1}- We assume that these random variables are independent of one another 
and of the sample selection mechanism; this is the pseudo-randomization model of Oh and Scheuren 
(1983). Since only the units in r are observed, a response model is used to estimate the probability 
of responding to the survey, pk = P(k G r\k G s) = P(Rk = G s), which is a function of the 
sample and must be positive. It follows that Rk are independent Bernoulli variables with expected 
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value equal to pk- We assume that the units respond independently of each other and of s and so 

q(r\s) = Y[p k -p k ). 

Suppose that in the survey there are m survey variables (items) of particular interest. Each 
respondent is exposed to these m questionnaire items, labelled I = 1, . . . , m. Suppose that the goal 
is to estimate the population total of the variables of interest and, in particular, of the variable of 
interest yj, i.e. Yj — J2k=i V k h wrtn Ukj being the value taken by yj on unit k. In the ideal case, if 
the response distribution q(r\s) is known, then the p^'s would be known and available to estimate 
Yj in a two-phase fashion. Thus, the combined weights \j{^kPk) could be computed for all k e r 
and used in the following Horvitz-Thompson (HT) estimator 

ker 7TkPk 

It is unbiased for Yj, under the quasi-randomization approach of Oh and Scheuren (1983), where 
the statistical properties are evaluated using the joint distribution of the sampling design and the 
response mechanism. Since the p^s are unknown in practice, 5^,2 P ht cannot be computed. 
The naive estimator 

kEr k£r 

does not adjust for nonresponse and is sometimes computed in surveys. Following Saxndal and 
Lundstrom (2005, p. 68), its bias can very large, but " may still find use if there is no better auxiliary 
information and/or if there is strong reason to believe that the nonresponse occurs at random." 
When this is not the case, the pkS must be estimated and a nonresponse adjusted estimator is 
constructed by replacing pk with an estimate % in (1). One obtains a Horvitz-Thompson type 
estimator defined by 

^.HT=E^- ( 3 ) 

Different methods to compute % have been proposed in the literature. All of these methods are 
based on the use of auxiliary information known on the population or sample level. 

In the case of non-ignorable nonresponse the variable of interest is itself the cause (or one 
of the causes) of the response behavior, and a covariance between the former and the response 
probability is produced through a direct causal relation (see Groves, 2006). In such a case, the 
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response probability pk should be modeled for k € s using logistic regression as follows 

Pk = P(R k = l\y kj ) = — r^— 7T, (4) 

1 + exp(-(a + a 1 y kj )) 

or as follows 

p k = P(R k = l|3//y,z fc ) = — — \ ■ j tt-, (5) 

1 + exp(-(a + aiy k j +z' fe a)) 

where z k — (zki, • • • , z kt )' is a vector with the values taken by t > 1 covariates on unit k. and a , 

ai and a are parameters. 

Nonresponse bias in the unadjusted respondent total of the variable of interest yj depends on 

the covariance between the values y k j and p k (see Bethlehem, 1988). An example of covariate that 

reduces the covariance between y k j and p k is the interest in the survey topic, such as knowledge, 

attitudes, and behaviors related to the survey topic (see Groves et al., 2006). The set of covariates 

Zfc should be also related to the variable of interest yj to reduce sampling variance (Little and 

Vartivarian, 2005). 

Since y k j is only observed on the respondents, Models (4) and (5) cannot be estimated. There- 
fore, usually, the values of z k that are known for both respondents and nonrespondents and are 
related to the 2/jy's by a 'hopefully strong regression' (Cassel et al., 1983) are used in the following 
model 

Pk = P(R k = l|z fc ) = — T^——, u 1 (6) 

1 + exp(-(a + z' k a)) 

Then, maximum likelihood can be used to fit Model (6) using the data (Rk, Zfc) for k G s. This leads 
to estimate So and S and to the estimated response probabilities pk = 1/(1 + cxp((— (a + z' k a) j) 
to be used in (3). Such procedure provides some protection against nonresponse bias if Zk is a 
powerful predictor of the response probability and/or of the variable of interest (Kim and Kim, 
2007). 

In what follows we propose an adjustment weighting system based on an auxiliary variable that 
measures the propensity of each unit to participate to the survey. To this end, further assumptions 
on the response model are introduced in order to assume a dependence of the p k 's on one latent 
auxiliary variable that is connected to the propensity scores of Rosenbaum and Rubin (1983). The 
proposed approach can be used when no other auxiliary information is available on k e s. 
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3 Latent variables as auxiliary information 

To obtain a measure of response propensities, we consider the case in which item nonresponse 
on the variables of interest is also present. Then, following Chambers and Skinner (2003, p. 278): 
'from a theoretical perspective the difference between unit and item nonresponse is unnecessary. 
Unit nonresponse is just an extreme form of item nonresponse', we assume that item response 
on the variables of interest is driven on respondents by the same attitude and factors that drive 
unit response. Then, latent variable models can be used to estimate such factors and use them as 
covariates in a logistic response model. Suppose that item nonresponse affects the aforementioned m 
survey variables (items) of particular interest. Each respondent is exposed to such to questionnaire 
items, labelled £ = 1, . . . , m. A second response indicator is introduced for each item t. For each 
item I and each unit k, a binary variable x kl is defined that takes value 1 if unit k answers to item 
I and otherwise. Let Xfc = (xti, ■ ■ ■ ,Xkt, ■ ■ ■ ,Xkm)' denote the vector of response indicators for 
unit k to the m items and let = (yki, ■ ■ ■ yu, ■ ■ ■ , Vkm)' be the study variable vector for unit k. 
Thus yki is the response value of unit k to item I and xti is the response indicator to the same 
item. 

Suppose the Xkt's are related to an assumed underlying latent continuous scale; they are the 
indicators of a latent variable denoted by 6k- De Menezes and Bartholomew (1996) call the variable 
9k the 'tendency to respond' to the survey. We call it here the 'will to respond to the survey' of 
unit k. A latent trait model with a single latent variable is used to compute 8k for each k € s (we 
will see later how; see Section 3.2). Assume for now that 9k is known on all sample units and, as 
with usual auxiliary information, can be used as a covariate. Without any other covariate, Model 
(6) is rewritten as 

Pk = P(R k = i\e k ) = — — , / - , .. . (7) 

1 + exp(-(a + a\6k)) 

The covariate 9k can be viewed as a variable explaining the behavior related to the survey topic, 
and thus having good properties to reduce the covariance between ykj and pk and the nonresponse 
bias (see Groves et al., 2006). 

Note that, in Model (7) auxiliary information takes the form of a latent variable. Of course, in 
case other suitable auxiliary information is available, it can be inserted in the model as supplemen- 
tary covariates. Now, to estimate the parameters of Model (7) the value of 9k has to be available 
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for all units in the sample. The following sections provide details on how to obtain estimated values 
of 9 k for both respondents and nonrespondents. 

3.1 Computing response propensities using latent trait models 

The variable 9k can be computed using a latent trait model. In general, latent variable models 
are multivariate regression models that link continuous or categorical responses to unobserved 
covariates. A latent trait model is a factor analysis model for binary data (see Bartholomew et al., 
2002; Skrondal and Rabe-Hesketh, 2007). 

We start by creating the matrix with elements {xke}kes-e=i,...,m- Figure 1 shows a schematic 
of the indicators xut for respondents and nonrespondents. 
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Figure 1: Schematic representing variables xu for the sets r and r 



We assume that the factors that drive unit response are the same as those that drive item 
response on selected variables of interest. That is item nonresponse is assumed nonignorable. Let 
qu be the probability of response of unit k for item £, for all£ = 1, . . . , m and k e r. As in the case 
of unit nonresponse, q k e should be modelled as a function of the variable of interest using logistic 
regression as follows 

1 



q u = P{x M = l\yu,0 k ) = 



(8) 



1 + exp(-(ft + PeiOk + Pi2Vkt)) ' 
for i = 1, . . . , m, and k € r. where /3 £, ftu ancl hi are parameters. Since yu is known only for 
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units with x k e = 1, k G r, Model (8) cannot be estimated. As in the case of unit nonresponse, we 
propose to estimate q M as a function of an auxiliary variable related to the variable of interest, 
that is 9k- Model (8) is rewritten 

qu = — — \ — for I = 1, . . . , m and k e r. (9) 

Note that Model (9) is not an ordinary logistic regression model, because the 0fc's are unobservable 
values taken by a latent variable. Latent trait models can be used in this case to estimate q k e, 9 k and 
the model parameters. Note that in the area of educational testing and psychological measurement, 
latent trait modelling is termed Item Response Theory. 

The Rasch model (Rasch, 1960) is a first simple latent trait model that is well known in the 
psychometrical literature and used for analyzing data from assessments to measure variables such 
as abilities and attitudes. It takes the following form 

q M = — — — — for I = 1, . . . , m and k e r. (10) 

1 + exp(-(/3«) + /3i9k)) 

The parameters Pe are estimated for each item I and reflect the extremeness (easiness) of item 
I: larger values correspond to a larger probability of a positive response at all points in the latent 
space. The parameter fi\ is known as the 'discrimination' parameter and can be fixed to some 
arbitrary value without affecting the likelihood as long as the scale of the individuals' propensities 
is allowed to be free. In many situations the assumption that item discriminations are constant 
across items is too restrictive. The two-parameter logistic (2PL) model generalizes the Rasch model 
by allowing the slopes to vary. Specifically, the 2PL model assumes the form given in equation (9). 
The parameters fin are now estimated for each item t and are typically assumed to be positive. 
They provide a measure of how much information an item provides about the latent variable 9 k . A 
further generalization to Model (9) is considered in the literature - the 3PL model - that includes 
another parameter, the guessing parameter, to model the probability that a subject with a latent 
variable tending to — oo responds to an item. Such extension does not seem necessary in the context 
at hand and will not be considered further. 

Latent trait models typically rely on the following assumptions. The first one is the so-called 
conditional independence assumption, which postulates that item responses are independent given 
the latent variable (i.e. the latent variable accounts for all association among the observed variables 
xe). Following Bartholomew et al. (2002, p. 181) 'the assumption of conditional independence can 



only be tested indirectly by checking whether the model fits the data. A latent variable model is 
accepted as a good fit when the latent variables account for most of the association among the 
observed responses.' 

A second assumption of Models (10) and (9) is that of monotonicity: as the latent variable 9k 
increases, the probability of response to an item increases or stays the same across intervals of 9k- 
In other words, for two values of 9k, say a and b, and arbitrarily assuming that a < b, monotonicity 
implies that P{xkt — l|#fc = a) < P{xkt = l|#fc = b) (or £ = l,...,m. Larger values of 9k are 
associated with a greater chance of a response to each item. 

Finally, the third, and possibly strongest, assumption of Models (10) and (9) is that of unidi- 
mensionality, implying that a single latent variable fully explains the willingness of unit k to answer 
the questionnaire. All these basic assumptions imply that the dependence between the variables 
Xki may be explained by the latent variable 9k which represents the units' willingness and that the 
probability that an unit k responds to a given item increases with 9k- 

In what follow we use the two-parameter logistic (2PL) model given in (9). Let (3 = {Piq, /?«)'• 
Usually the following distributional assumption on 9k is made: 9k ~ N(0, 1). Given 9k, the response 
indicators xu are independent and their conditional probability for a given 9k and (3 is 

m m 

.f( Xk \9k,(3) = Hfe(x ki \6 k ,p) = n«« w (l - Qki) 1 ^', 
e=i e=i 

where ft is the conditional probability of response indicator xu for the ^-th item. The likelihood 
of Xfc is given by 

/oo 
f(xk\9k,f3)h(9k)d9 k , 
-OO 

where h is the standard normal density function. Conditional to the estimated value of the 
probabilities qu are estimated by the model using 

/oo 
Mx M \9k,/3)h(9k)d9k- (11) 
-oo 

Different goodness-of-fit measures have been proposed in the literature (see Bartholomew 
et al., 2002). One of them is based on two-way and three-way margins of the response items. 
Discrepancies between the expected (E) and observed (0) counts in these tables are measured 
using the statistic (O — E) 2 /E. Response items or pairs of response items where the model does 
not fit well are revealed using the table margins. Note that the residuals (O — E) 2 / E are not 
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independent and they cannot be summed to give an overall test statistics distributed as a chi- 
squared (see Bartholomew et al., 2002, p. 186). This measure of goodness-of-fit is used in Section 
4. 

Assuming a simple random sampling (srs) without replacement of size n, Model (9) can be fit- 
ted by maximum likelihood, maximizing the following function assuming conditional independence 
among the response indicators Xkf- 

ln(£) = £ln(/(x fe )). 

Bartholomew and Knott (1999) discuss an extended EM algorithm to estimate Model (9). In the 
case of non-srs, where sampling weights Wk are available for all respondents, a sample-weighted 
pseudo-log-likelihood can be defined as 

^2 w k ln(/(x fe )). 

k£r 

Maximizing the pseudo- log- likelihood simultaneously with respect to fito,0ei, and xu provides 
design-consistent estimates of the underlying population parameters under suitable conditions (Pf- 
effermann, 1993). 

Estimates 9k, fleo and (in of 9k, (3m and (3a, respectively, are provided fitting Model (9). We 
use the estimate 9k as covariate in Model (7) instead of the unknown value of 9k- Model (7) is 
rewritten 

Pk = P(R k = l\9 k ) = — gr-. (12) 

1 + exp(-(a + aiffh)) 

It is important to test whether another latent variable may describe the responding willingness 
of units in a survey. In particular, other than the 'will-to-respond to the questionnaire', there may 
be also an 'ability' to answer to some items of the questionnaire for unit fc, given, for example, by 
being more or less informed on the topic surveyed by those items. Model (9) can be extended to 
fit latent trait models with several latent variables, allowing also for the incorporation of nonlinear 
terms between them as discussed in Rizopoulos and Moustaki (2008). With two latent variables 
9k, 7fe and an interaction term, Model (9) is rewritten as 

m = 1 + exp (-(/?«, + (3 n 9 k + /W + &30fc7k)) ' (13) 
An alternative solution to estimate pk is investigated in Matei (2010). Let Sk = Y^T=i Xki ^ e 
the raw score for unit fc, i.e. the number of items unit k has responded to: if fc e f, then Sk = 0; 

11 



if k e r, then Sfc > 0. Then can be estimated by modelling P(S k > 0). By the conditional 
independence assumption we have 

P(S k >0) = l-P(S fc = 0) 

= 1-P(n£ 1 (a: w = 0)) 

m 

1=1 

where Xki ~ Bernoulli(qki). The response probability pfc is then estimated by 

m 

pit = i - n^ 1 ~ 

£=1 

where is computed using Model (9), for I = 1, . . . , m. This solution has, however, a drawback: if 
there exists at least one qu close to 1, then % is close to 1, and there is no effect in the reweighting 
system. 

3.2 Computing k for non-respondents 

To compute a value 9k for k € f, we assume again that unit nonresponse is just an extreme form of 
item nonresponse. Thus, a nonrespondent does not answer to any item I and thus x^£ = 0, for all 
£ = 1, . . . ,77i. Model (9) allows the computation of Ok for this particular case, providing the same 
value of 9k for all € f. Thus, Model (9) allows the computation of 9k for all fees. This is a very 
important feature for the estimation of the response probabilities pk using Model (12). 

Sometimes the problem of separation is present, and Model (12) cannot be fitted directly. The 
phenomenon of separation is observed in the fitting process of a logistic regression model when 
the likelihood converges to a finite value while at least one parameter estimate diverges to (plus or 
minus) infinity. Separation occurs here since the zero cases (Rk = for G f) have the same value 
of the covariate 9k- To overcome this problem two practical solutions can be used: 

1. add a jitter to the value of 9k (some noise to 9k for the cases with Rk — 0), 

2. add some artificial cases with Rk — and with different values of 9k randomly generated 
from N(0, 1) distribution and then conduct the analysis in the usual fashion on the resulting 
data (see also Clogg et al., 1991). 
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3.3 Adjustment for item and unit nonresponse and corresponding esti- 
mators 

Recall that we have a variable of particular interest yj and that item nonresponse is present for 
it. Let r y = {k G r\xkj = 1} be the set of respondents for variable yj. If we wish to estimate the 
population total Yj — J2k=i Vkj °f Vji then a naive estimator that does not correct neither for unit 
nor for item nonresponse is given by 

CT! 7Tk r-c nk 

Imputation is often used to handle item nonresponse in survey practice. It consists in replacing 
missing values of items by imputed values. The naive estimator that uses imputed values y k - of 
ykj when Xkj — is given by 

n'Zvc = w( £ ff + £ d») 

k£r y k£r\r v k£r 

Reweighting item responders is another approach to handle item nonresponse. Moustaki and 
Knott (2000) propose to weight item responders by the inverse of the fitted probability of item 
response qu, assuming qke > 0. Therefore, a possible adjustment weight for item and unit nonre- 
sponse associated to unit k € r y is given by 1 / (pkqkj) ■ By considering different combinations of 
the aforementioned methods we propose the following estimators: 

• a three-phase estimator adjusted for item and unit nonresponse via reweighting, 

• a two-phase estimator estimator adjusted for unit nonresponse via reweighting and for item 
nonresponse using imputation {y k - for ykj when Xkj = 0), 

oim P= ^+ V *kr; (17) 

k£r v k€r\r y 

• a sort of model assisted compromise between estimators (16) and (17) given by 

p-imp = Vkj ykj ~ Vkj , lg s 

This estimator is inspired by that proposed in Kim and Park (2006) in the context of ratio 
imputation. 
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The properties of the proposed estimators (16)-(18) depend on the assumptions about the unit 
and the item nonresponse mechanisms. In particular, all three estimators assume a second phase 
of sampling with unknown response probabilities. If we ignore estimation of 9 k in Model (12), 
the results in Kim and Kim (2007) on consistency of the two-phase estimator that uses estimated 
response probabilities hold here as well when considering maximum likelihood estimates for the 
parameters a a and ot\. 

As of the imputation process, estimator Yj tPq in (16) follows a fully weighted approach (see 
e.g. Sarndal and Lundstrom, 2005, Chap. 12) and is a three-phase estimator with estimated unit 
and item response probabilities. Again, ignoring estimation of the latent variable 9k and using 
conditional maximum likelihood estimates for the parameters fyo and /3a in Model (9), estimator 
Yj^p g will be consistent if the models for unit and item nonresponse probabilities are correctly 
specified. On the other hand, estimator Y"™ p in (17) follows a combined approach with reweighing 
for unit nonresponse and imputation for item nonresponse. In this case, the estimator will be 
consistent if the unit response probability model and the imputation model assumed to obtain 
predicted values j/£ . both hold. Finally, estimator Y™^ takes a sort of model-assisted perspective 
in including the item nonresponse process. Therefore, it is expected to be double protected against 
imputation model misspecification. In particular, it is enough that either the imputation model 
assumed to obtain predicted values y^j or the item response probability model for qkj is correctly 
specified in order to achieve consistency. This is particularly relevant in our context in which no 
particular emphasis is put on the choice of the imputation model. 

4 Simulation results 

We evaluate the performance of the estimators presented in Section 3.3 by means of Monte Carlo 
simulations under two different settings. The first one uses a real data set as the population and 
considers variables of interest that are all binary, while the second one uses simulated population 
data with variables of interest that are all continuous. Results from the first setting are presented 
in Section 4.1, while those from the second setting are presented in Section 4.2. 
In both settings the following estimators have been considered. 

• HT = J2kes Ukj/^k- the Horvitz-Thompson estimator in the case of full response has been 
computed as a benchmark for bias and variance in absence of nonresponse. It reduces to 

14 



TV times the sample mean under srs without replacement (the design employed in both 
simulation settings). 

• Yj, naive'- the naive estimator in (14) has been computed as a benchmark for the other esti- 
mators, in which no explicit action is taken to adjust for unit and item nonresponse. Note 
that for srs without replacement, it reduces to Y^ na ive = ^J2ker y Vkj/ n r yl where n Ty is the 
size of the set r y , and it is the same as the Horvitz-Thompson estimator adjusted for unit 
nonresponse that assumes uniform response probabilities estimated by n Ty jn. 

• ^"nSvo 1 tne na i ye estimator that uses imputed values to treat item nonresponse considered 
in (15). 

• Yj^ pq : the three-phase estimator proposed in Section 3.3, equation (16). 

• Yj™: the three-phase estimator that uses the true values for the response probabilities pt and 
qkj has been also computed for comparison with Yj tPq to understand the effect of estimating 
the response probabilities. 

• Y™^: the two-phase estimator that adjusts for item nonresponse via imputation proposed in 
(17). 

• Y™^: the model assisted compromise proposed in (18). 

• Y™^ c : the estimator proposed by Laaksonen and Chambers (2006) and given by in our 
context by 

^ L p c = £jb+Ejb> as) 

k£r y nkP k ker\r y ^ 

where p^ c is an estimator of the response probability defined as p^ c = P(Rk = Mvtj) that 
accounts for non ignorable nonresponse using imputed values. Note that it takes the same 
form of estimator Y™^ v in (17), the only difference lays in the way in which the response 
probabilities pk are estimated. 

The simulations are carried out in R version 2.15, using the R package ltm (Rizopoulos, 
2006) to fit the latent models. The following performance measures have been computed for each 
estimator, generically denoted below by Y where suffix j is dropped for ease of notation: 
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the Monte Carlo Absolute Bias 

AB = E sim (Y) — Y, 

where 

M 

E sim (Y) = J2Yi/M, 
i=i 

Yi is the value of the estimator Y at the i-th simulation run and M is total number of 
simulation runs and 

the Relative Bias 

AB 

RB = — ; 

the Monte Carlo Standard Deviation 



, M 2 

\ 4 = 1 



• the Monte Carlo Mean Squared Error 

MSE = AB 2 + VAR; 

• the Bias-to-Standard error Ratio 

BSR=^=L. 
VVAR 

4.1 Simulation setting 1 

We consider the Abortion data set formed by four binary variables extracted from the 1986 British 
Social Attitudes Survey and concerning the attitude to abortion. N = 379 individuals answered to 
the following questions after being asked if the law should allow abortion under the circumstances 
presented under each item: 

1. The woman decides on her own that she does not wish to keep the baby. 

2. The couple agree that they do not wish to have a child. 

3. The woman is not married and does not wish to marry the man. 

4. The couple cannot afford any more children. 
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The variable of interest yj is selected to be the second one (j — 2) with a total Yj = 225 in the 
population, that is taken to be our parameter of interest. 

The data is analyzed by Bartholomew et al. (2002) as an example in which a latent variable 
can be found that measures the attitude to abortion. At the population level we have computed on 
the {yke}k=i,...,N;i=i,...,4 data such latent variable (denoted here by 6%) using the 2PL model in (9). 
The correlation between the values y k i and 6 k is approximatively equal to 0.85, for I = 1, . . . , 4. 
Afterwards, we have set 6 k = 6 k , for all k = 1, . . . , N. 

At the population level, the unit response probabilities are generated using the following 
response model 

Pk = 1/(1 + cxp(-(0.7 + y k2 + k + 0.2e fe ))), (20) 

with e k <~ U(0, 1), to simulate non ignorable nonresponse. The population mean of p k is approxi- 
mately 0.74. 

To generate item response probabilities at the population level the following model is used 

gjw = l/(l + exp(-(6/(9fc + o/ + y w ))), for£=l,...,4 (21) 

where bf = 3, for £ = 1, . . . , 4, while an takes different values according to £; in particular, a\ = 1, 
a 2 = 0, a 3 = —0.5 and a± = 1. The nominal item nonresponse rate for the four items in the 
population is 35%, 42%, 47%, 31%, respectively. 

We draw M = lO'OOO simple random samples without replacement (n k = n/N) from the 
population using two sample sizes: n = 50 and n = 100. In each sample s the units have been 
classified as respondents according to Poisson sampling, using the probabilities p k computed as in 
equation (20) and resulting in the set r. Then, given r, the matrix {x k e}ker;e=i,...,4 1S constructed 
where the values x k i are drawn using Poisson sampling with probabilities q k t defined in (21). 

In each simulation run, Model (9) and the respondents set r are used to compute the variable 
6 k for all fc e s. Model (12) is fitted to obtain p k . To avoid the separation problem in fitting Model 
(12) we iteratively add to each sample s new observations (R k , 9 k ) until convergence, with R k = 
and 6 k <~ iV(0, 1). These observations are afterwards deleted from the data set. The maximum 
number of added observations in simulations is 10 for n = 50 and 7 for n = 100. The average item 
nonresponse rate over simulations for the four items is found to be 26%, 33%, 38% and 23%. 
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Table 1: Simulation results for setting 1 - Abortion data set - n = 50 



Estimator 


AB 


VVAR 


MSE 


% RB 


% BSR 


HT 


0.6 


24.6 


603.3 


0.3 


2.5 


■^j, naive 


127.1 


19.3 


16531.1 


56.5 


658.3 


yimp 
j naive 


34.4 


33.6 


2311.4 


15.3 


102.4 


Y 


-4.5 


53.9 


2921.9 


-2.0 


-8.3 


Y 


0.5 


35.0 


1226.5 


0.2 


1.5 


3,P 


-12.7 


48.3 


2491.4 


-5.6 


-26.3 


yimp 

j,PQ 


-0.7 


59.0 


3479.4 


-0.3 


-1.2 


V- im P 


-27.8 


33.1 


1866.0 


-12.3 


-84.0 


2: Simulation results for settinj 


I 1 - Abortion data set - n 


Estimator 


AB 


VVAR 


MSE 


% RB 


% BSR 


HT 


0.1 


16.0 


254.5 


0.1 


0.3 


Yj i naive 


126.9 


13.4 


16284.9 


56.4 


945.0 


yimp 
naive 


36.1 


23.5 


1855.0 


16.0 


153.8 


Y 


-7.9 


67.3 


4588.1 


-3.5 


-11.8 




0.2 


23.7 


563.4 


0.1 


0.8 


yimp 

j,P 


-17.9 


33.6 


1449.5 


-7.9 


-53.2 


yimp 

j,P9 


-5.5 


44.4 


2000.9 


-2.5 


-12.5 




-27.4 


22.5 


1255.6 


-12.2 


-122.0 



The imputed values y* k2 (binary values) used by estimators Y™l We , Y™ p , and Y™[ c 

are computed according to Poisson sampling with predicted probabilities computed by Model (9), 
for all k € r y . Tables 1 and 2 give the results for n = 50 and n = 100, respectively. 

As expected, HT and ^, p(J ,true have almost zero bias, with the second one showing a relatively 
larger MSE that is due uniquely to the smaller sample size. The naive estimator shows a very large 
positive bias. This is due to the fact that units with a zero value of yj are less likely to respond and 
the total is clearly overestimated. This bias is reduced by l^aive' by meanm g that the imputation 
model employed is relatively good in describing item nonresponse. The proposed estimators all 
reduce significantly the bias compared to the two aforementioned estimators, with Y™^ providing 
the best performance, meaning that the unit and item response probability models are able to 
capture the attitude to respond to the questionnaire that is linked with the attitude towards 
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abortion. In particular, Yj tPq and Y-^ show a very good coverage measured by BSR in the last 
column, meaning that the model for the item response probabilities is better specified than the 
imputation model. This fact also affects the not very good performance of Y-™£ c that bases the 
model for the response probabilities p^s on the imputation model. In general, it can be noted 
that estimators that use imputation tend to have a smaller variance (like a shrinkage effect) that 
influences coverage. Note that the performance of all estimators but HT and Yj tPqttIue , is mostly 
driven by the absolute bias, so that the performance is not relatively different when increasing the 
sample size, apart from a reduction in coverage given by the decrease in variance. 

To study the performance of the latent model on this data set and the correlation between 
the variable of interest and the estimated latent variable we apply the procedure described before 
to construct the matrix {xke}k=i,...,N-,e=i,...,4 for all population units. On this dataset, we fit the 
2PL model in (9) and compute the variable 6^ for all k = 1, . . . , N. The Cronbach's alpha measure 
takes value 0.83 showing a good internal consistency of the items. The x 2 p-values for pairwise 
associations between the four items are smaller than 0.01. These p-values correspond to the 2x2 
contingency tables for all possible pairs of items. Inspection of non significant results can be used to 
reveal 'problematic' items. Latent variable models assume that high association between items can 
be explained by a set of latent variables. Thus, for pairs of items that do not reject independence 
we could say that they violate this assumption. 

The correlation coefficient between the variable of interest and the estimated latent variable 
takes value 0.76, indicating that the latent auxiliary information has a powerful power of predicting 
yk2, as advocated in the model of Cassel et al. (1983). The inspection of the two-way margins for 
the matrix {xkt} gives the residuals (O — E) 2 /E between 0.03 and 0.23. Similarly the three-way 
margins for the matrix {x^t} give residuals between and 1.19. This indicates that we have no 
reason to reject here the one- factor latent Model (9). This decision is based on the fact that if 
we consider the residual in each cell as having a \ 2 distribution with one degree of freedom, then 
a value of the residual greater than 4 is indicative of poor fit at the 5% significance level (see 
Bartholomew et al., 2002, p. 186). 
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4.2 Simulation setting 2 

We generate {y k i, ■ ■ ■ , Vk6, 9k} for k = 1, . . . , N = 2'000 using a multivariate normal distribution. 
The degree of correlation between ye and ye> is 0.8, with £, £' = 1, . . . , 6, 1 ^ £'. We set the variable 
of interest to be y\ and consider different degrees of correlation between the values y k i and 9k' 
namely, 0.3, 0.5, 0.8. The values of 9 k are afterwards standardized to have mean and variance 1. 
The response probabilities are obtained by first computing 

p2 = l/(l + exp(-(O.5 + y fc i+0 fc ))), for k= 1,...,N, (22) 

and then rescaling them to take values between 0.1 and 0.9 using the transformation 

Pk = (P° k - mmpfc)/(max^ - mmp° k ) x 0.8 + 0.1, (23) 

with a population mean approximatively equal to 0.7. 

The item response probabilities are generated by first computing 

q o k e = l/(l + exp(-(be0 k + ae + y k e))), for k = 1, . . . , AT and t = 1, . . . , 6, (24) 

where {ae}e=i,...,6 = {1, 0, —0.5, 1, 0, —0.5} and {be}e=i,...,6 = {1, 1, 1, 1-5, 1.5, 1.5}, and then rescal- 
ing the values to be between 0.1 and 0.95 using the transformation 

Qkt = {(lit - m , in 9«)/(maxg^ - ming^) x 0.85 + 0.1. (25) 

k k k 

We draw M = lO'OOO samples by simple random simple without replacement of dimension 
n = 200. For each sample s, a response set r is created by carrying out Poisson sampling with 
parameter p k . Each element of the matrix {xke}ker,e=i,...,6 IS generated using Poisson sampling 
with parameter q^e- Item nonresponse rates over simulations take approximately value 12%, 20%, 
25%, 14%, 22%, 26%, for I = 1, . . . , 6, respectively. 

For each simulation run, the 2PL model in (9) is used to compute the variable 0k for all fees. 
Model (12) is then fitted to obtain p k - In this simulation setting there has been no need to add 
randomly generated values to avoid the separation problem in fitting Model (12). 

The imputed values y kl (continuous in this case) to be used by estimators Y-™^ ivc , Yj™ p , 
Yj™q an d ^'™lc nave been computed using the regression imputation method. In particular, the 
following linear regression model 

Vk\ =a + ai#fc+£fc, e k -N(0,a 2 ) (26) 
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Table 3: Simulation results for setting 2 - Simulated continuous data - correlation coefficient 0.3 



Estimator 


A"R 


V VArt 


JVlorj 


/O rU3 


0/ TJQO 

/o rSort 




0.4 


1 1 o 

131.8 


17360.33 


<0.0 


0.3 


1 j, naive 


392.2 


149.4 


1 761 59 8 


9 Q 


262.5 


yimp 
j, naive 


Q77 7 


1^17 
101. I 




y.o 


0/1 ft 


Y 


-27.3 


472.7 


224187.0 


0.7 


-5.8 




0.2 


222.1 


49323.2 


<0.0 


0.1 


yimp 

3,P 


89.3 


398.8 


167027.6 


2.2 


22.4 


yimp 

j,pq 


82.0 


393.8 


161781.3 


2.1 


20.8 


T>imp 


296.0 


219.7 


135877.58 


7.4 


134.7 



Table 4: Simulation results for setting 2 - Simulated continuous data - correlation coefficient 0.5 



Estimator 


AB 


VVAR 


MSE 


% RB 


% BSR 


HT 


0.8 


136.1 


18514.7 


<0.0 


0.6 


Y ■ 

1 J, naive 


483.5 


152.2 


256972.7 


12.1 


317.6 


yimp 
j, naive 


439.8 


160.0 


219048.5 


11.0 


274.9 


Y 


-116.3 


520.4 


284354.1 


-2.9 


-22.4 


Y 


6.5 


223.7 


50080.7 


0.2 


2.9 


3,P 


14.3 


475.1 


225957.74 


0.4 


3.0 


yimp 
3,P1 


0.6 


463.0 


214326.4 


<0.0 


0.1 




229.2 


328.2 


160281.0 


5.7 


69.8 



is fitted on the set r y . The imputed values are given by the predicted values of Model (26) for 
given 9k 1 k £ r\r y . Tables 3, 4, and 5 give the performance of the estimators for the three values 
taken by the nominal correlation coefficient between yki and 9k - 0.3, 0.5, and 0.8, respectively. 

The proposed estimators are always able to reduce the bias over the naive estimators, even 
when the correlation between the variable of interest and the latent variable gets smaller. The 
relative bias takes acceptable values in most cases, while the ratio between the bias and the standard 
error gets in some instances larger than 20%, providing some concern on coverage. The absolute 
bias deserves a closer look. The naive estimators in all cases largely overestimate the total. This is 
expected, because the values pk, qtj, 9k and y^x all go in the same direction. Therefore we have in 
our respondents sample more likely relative larger values for yi by this providing overestimation for 
the naive estimators. The imputation model uses the relationship between yki and 9k and, therefore, 
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Table 5: Simulation results for setting 2 - Simulated continuous data - correlation coefficient 0.8 



Estimator 


AB 


VVAR 


MSE 


% RB 


% BSR 


HT 


-2.0 


136.3 


18578.1 


<0.0 


-1.5 


Yj_ naive 


535.5 


147.8 


308568.0 


13.2 


362.2 


yimp 

j, naive 


447.6 


166.8 


228191.4 


11.1 


268.3 


Y 


-220.4 


531.8 


331350.2 


-5.5 


-41.4 


v 


8.3 


220.6 


48738.8 


0.2 


3.8 


yimp 

3,P 


-111.5 


532.9 


296441.1 


-2.8 


-20.9 


yimp 

j,pq 


-132.4 


510.2 


277853.3 


-3.3 


-25.9 


rMmp 


23.6 


480.1 


231081.5 


0.6 


4.9 



helps Yj lm aive to decrease the AB, and this gets relatively larger as the level of the correlation 
between yki and 9k increases. On the other hand, Y^ pq underestimates the total because it is based 
only on the observed units of r y that do have relatively large values for yi, but also relatively 
large values for p and qi and, therefore, end up having a small weight. This underestimation gets 
larger and larger as the correlation between 9k and yki (and therefore, between 9k and the response 
probabilities) increases. The use of the imputation model in estimators Y™ p and Y™^ helps in 
adjusting by excess this estimate, with the latter one being a good compromise. In fact, Y™^ 
provides more often a better performance among the three. The Y™^ c also shows a good behavior 
in reducing the bias, although its performance is very much linked to the degree of the correlation 
between the variable of interest and the latent variable (therefore it behaves worse as it decreases). 

The matrix of population values {xke}k=i,... ,2000^=1,. ..,6 has been constructed in the same way 
as in Section 4.1 to validate the assumptions behind the 2PL model. The Cronbach's alpha takes 
approximately value 0.5 for the correlation coefficient equal to 0.3, 0.6 for 0.5, and 0.7 for 0.8; the 
pairwise association between the six items reveales p-values smaller than 0.01. The inspection of 
the two-way and three-way margins of the matrix {xkt} gives residuals (O — E) 2 / E that all take 
values smaller than 4. Therefore, the one factor latent model can be accepted and items seem to 
be all measuring the same latent trait. 

To test what happens to estimators Y™^ ive , Yj™ p , Y"^ and Yj™l c if we use a different impu- 
tation model, we run another simulation. To this end, we use the simulated population data where 
the correlation coefficient between yki and 9k is 0.8. The imputed values y* kl are now computed 
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Table 6: Simulation results for setting 2 - Simulated continuous data - correlation coefficient 0.8 



Estimator 


AB 


VVAR 


MSE 


% RB 


% BSR 


HT 


O.f 


137.9 


19016.2 


<0.0 


< 0.0 


X?', naive 


558.8 


149.3 


334524.8 


14.0 


374.3 


yimp * 

j, naive 


558.8 


149.3 


334524.8 


14.0 


374.3 


Y 


-237.8 


534.2 


341952.7 


6.0 


-44.5 


v 

1 j,pq,true 


5.4 


220.0 


48429.1 


0.1 


2.4 


Y ini P * 

3,P 


80.6 


405.4 


170841.8 


2.0 


19.9 




27.2 


404.5 


164319.4 


0.7 


6.7 


V im P * 


558.8 


149.3 


334524.8 


14.0 


374.3 



* Mean imputation model used to obtain y* values. 



using the mean of all respondents of the set r y . i.e. a neutral imputation model. Table 6 gives the 
results of this simulation. It is expected that the naive estimators and ^™lc take the same value 
in this case. Note also that such values are similar to that of the naive estimator in Table 5; this 
is true also for Yj, pq . In fact, differences in these cases stem only from simulation error. Estimators 
y"p P and Y"pq improve their performance over that of Table 5, because they use an imputation 
model that predicts y* values that are relatively larger (recall that units in the r y set have relatively 
larger values of y) and, therefore, help reducing the underestimation of estimator Yj^ pq in this case. 
It is therefore, very important, to take into consideration the correlation structure existing among 
p, q, 9 and y to try to take the best out of it. 

Finally we explore in another simulation the robustness of the proposed method to the case 
in which 9k does not follow a normal distribution. Population data is generated like in the last 
simulation. Then, the values 9k are transformed to follow a skew distribution by adding a x 2 
random variable with one degree of freedom. This provides a new variable 9k with skewness equal 
to 1.54. The correlation coefficient between the variable of interest and the obtained latent variable 
is about 0.5. 

At the population level, the unit response patterns are generated by the following equation 

Pk = 1/(1 + exp(-(0.7 + V ki - O.30 fc ))), (27) 

to achieve a population mean approximatively equal to 0.82. The item response probabilities are 
generated as in (24) and then rescaled to the range 0.1 and 0.75. The items' nonresponse rate 
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Table 7: Simulation results for setting 2 - Simulated continuous data - skew distribution for 9k, 
correlation coefficient 0.5 



Estimator 


AB 


VVAR 


MSE 


RB 

(in %) 


BSR 

(in %) 


HT 


-13.3 


181.8 


33227.0 


-0.3 


-7.3 


Y ■ 

1 J, naive 


266.6 


248.9 


133017.0 


6.7 


107.1 


yimp 
j, naive 


259.2 


248.2 


128796.9 


6.5 


104.4 


Y 


-104.3 


416.3 


184162.8 


-2.6 


-25.1 




12.0 


315.0 


99370.5 


0.3 


3.8 


yimp 
hV 


142.6 


256.2 


85986.3 


3.6 


55.7 


yimp 

j,VQ 


144.0 


255.1 


85829.5 


3.6 


56.4 




242.9 


249.1 


121078.4 


6.1 


97.5 



means in simulations take values between 25% and 29%. Imputed values for estimators Yj™^ ive , 
Y*™ p , Y"pq and Y"^l c are computed using the model of equation (26). 

Table 7 gives the performance estimators for this simulation setting. As in the previous simu- 
lations the proposed estimators perform better than the naive estimators in terms of bias. It seems 
that the skewness of the distribution of 9k does not have a negative influence on the behavior of 
the proposed estimators. Estimator Yj tPq shows the best performance with the lowest bias and a 
good coverage. On the other hand, there is some concern on coverage for those estimators based 
on imputation, for which the standard errors are relatively lower, but bias, although smaller than 
for the naive estimators, is still quite high. 

5 Conclusions 

We have proposed a reweighting system to compensate for unit non-ignorable nonresponse based 
on a latent auxiliary variable. This variable is computed for each unit in the sample using a 
latent model assuming the existence of item nonresponse and that the same latent structure is 
hidden behind item and unit nonresponse. Unit response probabilities are then estimated by a 
logistic model that uses as covariate the latent trait extracted by the response patterns using Item 
response theory models. The proposed reweighting system is then used in different estimators, 
assuming imputation, reweighting or a combination of the two to handle item nonresponse. The 
main goal is to reduce the nonresponse bias in the estimation of the population total. The proposed 
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estimators perform well in our simulation studies comparing to the naive estimators, and the gain in 
efficiency is substantial in certain cases. Reductions in bias are displayed also when the correlation 
between the latent trait and the variable of interest is modest. 

It is also interesting to study the trade-off between the reduction of nonresponse bias and the 
increase in variance. By construction the estimated latent variable Ok is related to the response 
indicators x k j for the variable of interest yj] since nonresponse is assumed to be non-ignorable, y k j 
and related as well. If the following condition holds, 

P%, Xj + p\ x . > 1, 

where the correlation coefficients p Vj tXj , Pg x . > 0, then yj and 9 are positively correlated (see Lang- 
ford et al., 2001). Following Little and Vartivarian (2005), 'a covariate for a weighting adjustment 
must have two characteristics to reduce nonresponse bias - it needs to be related to the probability 
of response, and it needs to be related to the survey outcome. If the latter is true, then weighting 
can reduce, not increase, sampling variance.' A positive correlation between yj and 9 can thus 
provide also reduction of the variance of the proposed estimators. 

We have considered the case in which no auxiliary information is available at the sample 
or population level to reduce nonresponse bias. Observed covariates (if available) and the latent 
variable 9 k can be, however, used together in the estimation of the response probabilities. Moreover, 
latent trait models can be fitted with covariates. The introduction of covariates in these models 
should be carried out with the prudence of variance increasing. 
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